The CPU for agents.
Overview
NVIDIA Vera is built for the CPU work behind agentic AI and reinforcement learning (RL), including code execution, tool use, sandboxing, analytics, data pipelines, and orchestration beyond the model. As both a host CPU for accelerated systems and a standalone CPU for AI factory workloads, Vera keeps GPUs fed, agents responsive, and training loops moving. With fast, energy-efficient NVIDIA Olympus cores and high-bandwidth LPDDR5X memory, Vera delivers up to 80 percent faster sandbox environment performance than traditional CPU infrastructure, helping AI factories generate more tokens per dollar.
The NVIDIA Vera CPU Rack powers reinforcement learning and agentic AI at AI factory scale. Built on NVIDIA MGX™, it integrates up to 256 Vera CPUs to run over 22.5K concurrent environments.
Highlights
Agentic AI is bottlenecked by traditional CPUs. Across an agent's reasoning loop, the CPU compiles generated code, runs Python tool chains, and analyzes software code. NVIDIA Vera accelerates all three workloads by up to 1.8x over leading x86 CPUs, turbocharging the agentic inner loop to maximize AI factory output.
Relative performance based on measured data, and subject to change. NVIDIA Vera CPU with LPDDR5X performance baselined to latest generation x86 CPU.
Relative performance based on measured data and subject to change. NVIDIA Vera CPU with LPDDR5X performance baselined to latest generation x86 CPU with DDR5 across key CPU memory performance metrics.
Traditional DDR5 forces a tradeoff between bandwidth, efficiency, and serviceability. NVIDIA Vera pairs LPDDR5X memory with SOCAMM, detachable, field-replaceable modules that deliver low-power (LP) efficiency with server-class flexibility and upgradable capacity. The result is 2x the bandwidth, 3x the bandwidth per core of leading x86 CPUs with DDR5, unlocking greater AI factory output at hyperscale and enterprise scale.
Use Cases
Features
Built for the demands of reinforcement learning and agentic AI, NVIDIA Vera combines custom-designed Olympus cores, high-bandwidth LPDDR5X memory, and low-latency NVIDIA Scalable Coherency Fabric (SCF). With NVIDIA NVLink™-C2C connectivity, confidential computing, and full Arm® compatibility, Vera extends across accelerated systems and modern data center environments. Its monolithic compute architecture keeps software environments responsive and data moving efficiently, helping to maximize throughput, energy efficiency, and GPU utilization across AI, analytics, and HPC workloads.
NVIDIA Vera Rubin NVL72 unifies leading-edge technologies from NVIDIA: 72 Rubin GPUs, 36 Vera CPUs, ConnectX®-9 SuperNICs, and BlueField-4 DPUs. It scales up intelligence in a rack-scale platform with the NVLink 6 switch and scales out with NVIDIA Quantum-X800 InfiniBand and Spectrum-X™ Ethernet to power the AI industrial revolution.
Get Started
Sign up for the latest news, updates, and more from NVIDIA.